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The MSTW2008 PDFs [1] were recently updated with a fit in the same general framework - 
the MMHT2014 PDFs [2], These were an improvement due to a number of developments in theory 
or procedures. For example, we now use extended parameterisation with Chebyshev polynomials, 
and freedom in deuteron nuclear corrections introduced in [3], which led to a change in the uy — 
dy distribution. We use the “optimal” GM-VFNS choice [4] which is smoother near to heavy 
flavour transition points, particularly at NLO. Correlated systematic uncertainties are now treated 
as multiplicative not additive. We have also changed the value of the charm branching ratio to 
muons used to B^ = 0.092 ± 10% [5]. There are also a wide variety of new data sets included in 
the fit. This includes LF1C (ATLAS, CMS and LFICb) W. Z cross sections differential in rapidity, 
and Drell Yan at high and low mass, and also data on a t j from the Tevatron and from ATLAS and 
CMS. At NLO we also include ATLAS and CMS inclusive jet data, though do not yet include 
these data at NNLO. Previous analyses have used threshold corrections for Tevatron data and we 
continue to include these data, but at the LHC we are often far from threshold. The full NNLO 
calculation [6, 7] is nearing completion. 
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Figure 1: A comparison of x 2 per point for the three variations of NLO and NNLO fits, all with fixed lower 
Q 2 • = 2 GeV 2 , for different D 2 ■ on the HERA data used in the calculations. 
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There are also various changes in non-LHC data sets, e.g. we include some updated Tevatron 
asymmetry data sets. The single most important change in data included is the replacement of 
the F1ERA run I neutral and charged current data provided separately by HI and ZEUS with the 
combined HERA data set [8] (we also include HERA combined data on F 2 c (x,<2 2 ) [9]), which is 
the data set which provides the best single constraint on PDFs, particularly the gluon at all x < 0.1. 
However, we decided not to include any separate run II HI and ZEUS data sets, since it was clear 
the run I + II combination data would soon appear. This has now happened, and the data, and the 
accompanying PDF analysis, are in [10]. It was left as an open question in [2] when an update of 
MMHT2014 PDFs would be required. Significant new LHC data would be one potential reason, 
and the full NNLO calculation of the jet cross sections might be another. However, the potential 
impact of the final HERA inclusive cross section data was another factor, it being possible that these 
alone might produce a significant change in the central value of the PDFs and their uncertainties. 
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Hence, it is now obviously a high priority to investigate their impact. 



Figure 2: The variation in y 2 per point when increasing the Q^ m cut, on only HERA Run I + II data, at 
both NLO (left) and at NNLO (right). 


Using our standard cut of <2" lin = 2 GeV 2 there are 1185 HERA data points with 162 corre¬ 
lated systematics and 7 procedural uncertainties. These are separated into 7 subsets, depending on 
whether it is e + or e scattering from the proton, whether it is neutral or charged current scatter¬ 
ing and on the proton beam energy E p . This is to be compared to 621 data points, separated into 
5 subsets, with generally larger uncertainties, from the HERA I combined data used previously 
(though this has fewer correlated systematics). We first simply investigate the fit quality from the 
predictions using MMHT2014 PDFs. 1 This is already rather good: 

Xnlo = 1611/1185 = 1.36 per point. 

Xnnlo = 1503/1185 = 1.27 per point. 

In contrast HERAPDF2.0 PDFs, which are fit to (only) these data obtain ~ 1.20 per point with 
<2min = 2 GeV 2 at both NFO and NNFO. Next we try refitting in the context of our standard global 
fit, i.e. we simply replace the previous HERA run I data with the new run I + II combined data. 
The fit improves to 

Xnlo = 1533/1185 = 1.29 per point, with deterioration Ax 2 = 29 in other data. 

Xnnlo = '457/1185 = 1.23 per point, with deterioration Ay 2 = 12 in other data. 

This is a significant, but hardly dramatic improvement (less than the improvement after refitting 
when HERA run I combined data were first introduced into the MSTW2008 fitting framework), 
i.e. the MMHT2014 PDFs were already giving quite close to the best fit within the global fit 
framework. In order to compare more directly with the HERAPDF2.0 study we also fit only HERA 
run I -f II data. This requires us to fix 4 of our PDF parameters in order to avoid particularly unusual 
PDFs, in practice a very peculiar, and potentially pathological strange distribution - HERA data not 
having any direct constraint on this. We allow the strange + antistrange distribution to have a free 
normalisation and high-x power, but all other shape freedom is removed. The strange-antistrange 
asymmetry is fixed to the MMHT2014 default values. The result is 

Xnlo = 1416/1185 = 1.19 per point 

*Our numbers are very slightly different to those presented at EPS2015 due to the correction of a minor bug. 
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Xnnlo = 1381/1185 = 1.17 per point 

Hence, in this case, as well as the global fit, the NNLO fit quality is definitely better than NLO. 
Part of this is due to the charged current data y 2 which is 20 units better in HERA data only fits 
than the global fits and 10 units better at NNLO than at NLO. There appears to be some tension 
between these charged current data and other data in the global fit, though this is partially resolved 
at NNLO. 



no. points 

NLO Xhera 

NLO Xglobal 

NNLO Xhera 

NNLO* 2 oM 

correlated penalty 


79.9 

113.6 

73.0 

92.1 

CC e+p 

39 

43.4 

47.6 

42.2 

48.4 

CC e~p 

42 

52.6 

70.3 

47.0 

59.3 

NC e~p E p = 920 GeV 

159 

213.6 

233.1 

213.5 

226.7 

NC e + p E p = 920 GeV 

377 

435.2 

470.0 

422.8 

450.1 

NC e+p E p = 820 GeV 

70 

67.6 

69.8 

71.2 

69.5 

NC e-p E p = 460 GeV 

254 

228.7 

233.6 

229.1 

231.8 

NC e-p E p = 920 GeV 

204 

221.6 

228.1 

220.2 

225.6 

total 

1145 

1342.6 

1466.1 

1319.0 

1403.5 


Table 1: The r 2 for each subset of HERA I + II data for the four variations of fit for Q 2 - = 3.5 GeV 2 . 

** ^min 

The HERAPDL2.0 analysis sees a distinct improvement in y~ per point with a raising of the 
<2^ in value for the data fit. Hence, we also investigate the variation of the fit quality with (9 2 im . 
However, first we simply calculate the quality of the comparison to data as a function of (9 2 in at 
NLO and at NNLO without refitting, i.e. the PDLs are obtained with the default <2 2 lin = 2 GeV 2 
cut. This is shown in Lig. 1 where we show a comparison of y 2 P er point for the three variations of 
NLO and NNLO comparisons, i.e. MMHT2014 prediction, global refit with new HERA data and 
refit with only HERA run I + II combined data. Here it is clear that NNLO is superior, but this is 
less distinct in the refits, particularly the fit to only HERA data. There is a reasonable lowering of 
the y 2 per point as Q 2 iin increases. 

We also look at the effect of changing the Q 2 cut in the fit (on only the HERA combined 
data), at both NLO and NNLO. This is shown in Pig. 2, where we also show the trend for the 
HERAPDP2.0 analysis [10]. 2 Lor comparison we also include the curves for the y 2 per point with 
varying Q] mn but with the fits performed for <2 2 lin = 2 GeV 2 . In fact we note that while there is 
an improvement in y 2 per point with increasing <2^ in , this is very largely achieved without any 
refitting. There is also less improvement in our analysis than for HERAPDP2.0, particularly in the 
global fit and at NNLO. In order to investigate the source of the improvement with increasing 0 2 lim 
we looked in more detail at the fit quality for low Q 2 bins. We found that there was quite a large 
degree of point-to-point fluctuation in theory/data at low Q 2 rather than any obvious systematic 
issue (though the lowest a: point at each Q 2 is often below theory, i.e. there is limited evidence of 
more of a turn-over in data than theory). In many cases it seems impossible to avoid some rather 
high y 2 points with any smooth curve. We present the detail of the breakdown of y 2 into each 

2 The definition of y} for HERAPDF2.0 is not identical, though this should be a very small effect. 
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Up valence (NNLO), percentage difference at Q 2 — 10 4 GeV 2 




Figure 3: The comparison of NNLO MMHT PDFs containing the new HERA data to MMHT2014 PDFs 
and the HERAPDF2.0 PDFs. 


of the data subsets in Table 1. We do this for Q^ in = 3.5 GeV 2 to allow direct comparison to the 
HERAPDF2.0 results, and also for possible comparison with e.g. NNPDF and CT results. 

In Fig. 3 we show the central values of the NNLO PDFs from the fits including the new HERA 
combined data, comparing them to MMHT2014 PDFs (with uncertainties) and the HERAPDF2.0 
PDFs (also with uncertainties). The modified PDFs are always very well within the MMHT2014 
uncertainty bands. Indeed, the predictions for e.g. gg —> H change by < 0.2% for the full range of 
LHC energies. The PDFs from the HERA run I + II data only fit are in some ways similar to those 
of HERAPDF2.0, e.g. the up valence quark for x > 0.2, which show some significant deviations to 
the two global fits PDF sets. However, the features are not universal, with the gluon and the down 
valence being much more similar to MMHT2014 than HERAPDF2.0. This may be a feature of the 
differing parameterisations used in the two studies. 

We also investigate the effect of the new data on the uncertainties of the PDFs. In Fig. 4 
we compare the uncertainties for the NNLO PDFs including the HERA run I + II data to the 
MMHT2014 PDFS. These are quite similar to MMHT2014 in most features. The most obvious 
improvement is to the gluon for * < 0.01. There is also slight improvement in some places for the 
valence quarks. We understand that NNPDF results on central values and uncertainties are similar 
[11]. Hence, the HERA run I + II combined data gives us our single best constraint on PDFs, and 
determines the gluon at low v even more accurately than before, but its inclusion does not suggest 
updates of existing PDFs are immediately necessary. 
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Figure 4: The comparison of the uncertainty on NNLO MMHT PDFs from a global fit containing the new 
HERA data to that on the MMHT2014 PDFs. 
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